Some Design Aspects for VLIW Architectures Exploiting Fine - Grained Parallelism
نویسنده
چکیده
Very Long Instruction Word Architectures (VLIW architectures) can exploit the ne{grained (instruction level) parallelism typically found in sequential{natured program code. A parallelizing compiler is used to restructure the program code. Sophisticated global compaction techniques have emerged that can e ectively extract ne{grained parallelism from ordinary sequential natured program code. In this paper we propose an e ective mechanism for multiway branches and introduce a generalized conditional execution model for VLIW architectures. For the evaluation of VLIW architectures and their parallelizing compilers we use a simulation environment. This simulation environment comprises a parallelizing compiler and a highly con gurable simulator for VLIW architectures. With this simulation environment the architectural enhancements proposed in this paper can be evaluated. Our studies are directed in nding high performance combinations of VLIW architectures and parallelizing compilers.
منابع مشابه
Balancing Fine- and Medium-Grained Parallelism in Scheduling Loops for the XIMD Architecture
This paper presents an approach to scheduling loops that leverages the distinctive architectural features of the XIMD, particularly the variable number of instruction streams and low synchronization cost. The classical VLIW and MIMD architectures have a fixed number of instruction streams, each with a fixed width. A compiler for the XIMD architecture can exploit fine-grained parallelism within ...
متن کاملStream Execution on Embedded Wide-Issue Clustered VLIW Architectures
Very long instruction word(VLIW-) based processors have become widely adopted as a basic building block in modern Systemon-Chip designs. Advances in clustered VLIW architectures have extended the scalability of the VLIW architecture paradigm to a large number of functional units and very-wide-issue widths. A central challenge with wide-issue clustered VLIW architecture is the availability of pr...
متن کاملExploiting loop-level parallelism on coarse-grained reconfigurable architectures using modulo scheduling - Computers and Digital Techniques, IEE Proceedings-
Coarse-grained reconfigurable architectures have become increasingly important in recent years. Automatic design or compilation tools are essential to their success. A modulo scheduling algorithm to exploit loop-level parallelism for coarse-grained reconfigurable architectures is presented. This algorithm is a key part of a dynamically reconfigurable embedded systems compiler (DRESC). It is cap...
متن کاملExploiting Multi - Grained Parallelism for Multiple - Instruction - Stream Architectures
Exploiting parallelism is an essential part of maximizing the performance of an application on a parallel computer. Parallelism is traditionally exploited at two granularities: individual operations are executed in parallel within a processor to exploit instruction-level parallelism and loop iterations or processes are executed in parallel on different processors to exploit loop-level paralleli...
متن کاملAutomatically Customising VLIW Architectures with Coarse Grained Application-Specific Functional Units
Instruction Level Parallelism (ILP) machines, such as Very Long Instruction Word (VLIW) architectures, and customised architectures are two paradigms that are used to increase the performance of processors. While a VLIW machine has multiple functional units, a customised processor is equipped with Application-specific Functional Units (AFUs). Customisation has been proved beneficial on single i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1993